Method for efficient retransmission timeout estimation in NACK-based protocols

ABSTRACT

Disclosed is a system and method for estimating retransmission timeout (RTO) in a real-time streaming applications over the Internet between a server and a client. Accordingly, the present invention employs retransmission timeout (RTO) in NACK-based applications to support multiple retransmission attempts per lost packet, wherein the RTO is estimated by an actual around-trip delay (RTT) and a smooth inter-packet delay variance.

CROSS REFERENCE TO RELATED APPLICATION

[0001] This application claims the benefit of U.S. Provisional application Ser. No. 60/262,591 filed Jan. 18, 2001, the teachings of which are incorporated herein by reference.

BACKGROUND OF THE INVENTION

[0002] 1. Field of invention

[0003] The present invention relates to retransmission timeout (RTO) estimators, and particularly, to a system and method for estimating RTO in the NACK-based real-time streaming applications that support multiple re-transmission of the same packet.

[0004] 2. Description of the Invention

[0005] In general, there are two types of Internet transport protocols that support lost packet recovery in a data communication network. The first approach is ACK-based as set forth under the transmission control protocol (TCP), which involves the receiver sending a positive acknowledgment (ACK) in response to each received packet. The second approach is NACK-based under a user datagram protocol (UDP), which involves the receiver sending a negative acknowledgment (NACK) in response to each lost packet.

[0006] Referring to FIG. 1(a), TCP utilizes a system of positive acknowledgments (ACK) for data arriving to the receiving endpoint as the mechanism for error recovery. This system operates under the principle that only unacknowledged frames should be retransmitted. To ensure that the packet is safely received by the sending source, TCP uses a retransmission timeout (RTO) mechanism by managing a retransmission timer for each connection. That is, TCP sets the retransmission timer and tacks an RTO value and a round trip time (RTT) for the connection. The RTT is the time elapsed between the start of transmission of a TCP-type data segment and the receipt of an acknowledgment of that segment. If an acknowledgment is not received by the time the RTO₁ expires, TCP retransmits the data again within next the RTO₂.

[0007] In contrast, UDP utilizes a system of negative acknowledgments (NACK) by forwarding a NACK packet to the sending source in response to the lost frame for retransmission, as shown in FIG. 1(b). In addition, the NACK packet can be lost along the path from the receiver to the sender. To this end, UDP utilizes a retransmission timeout mechanism that is similar to the TCP for retransmission connection.

[0008] It is important that the estimation of an RTO value is performed accurately. Normally, the RTO estimation is performed by predicting the next value of the RTT based on the previous samples of the RTTs. If the RTO is overestimated, it leads to lower throughput performance in TCP and may cause an increased number of under-flow events in real time application. Yet, if the RTO is underestimated, the protocol generates a large number of duplicate packets that cause serious network congestion as more of unnecessary packets are retransmitted.

[0009] A background of current standards, which is based on TCP's retransmission timeout estimator, is described hereinafter. The standard consists of two algorithms described below. The first algorithm, smoothed RTT estimator (SRTT), is based on an exponential-weighed moving average (EWMA) of the past RTT samples: $\begin{matrix} {{SRTT}_{i} = \left\{ \begin{matrix} {{RTT}_{0},{i = 0}} \\ {{{\left( {1 - \bullet} \right) \cdot {SRTT}_{i - 1}} + {\bullet \cdot {RTT}_{i}}},{i \geq 1},} \end{matrix} \right.} & (1) \end{matrix}$

[0010] where RTT, represents the i-th sample of the round-trip delay produced at time t₁ and □ (set by default to ⅛) represents a smoothing factor that can be varied to give more or less weight to the history of RTT samples.

[0011] The second algorithm, smoothed RTT variance estimator (SVAR), computes an approximation to the RTT variance using similar EWMA formulas to the ones described above: $\begin{matrix} {{SVAR}_{i} = \left\{ \begin{matrix} {{{RTT}_{0}/2},{i = 0}} \\ {{{\left( {1 - \beta} \right) \cdot {SVAR}_{i - 1}} + {\beta \cdot {VAR}_{i}}},{i \geq 1},} \end{matrix} \right.} & (2) \end{matrix}$

[0012] where β (set by default to ¼) represents an EWMA smoothing factor and VAR_(l) represents the absolute deviation of the i-th RTT sample from the smoothed average: VAR_(l)=|SRTT_(l−1)−RTT_(l)|.

[0013] Finally, the RTO is determined by multiplying the smoothed variance by four and adding it to the smoothed round-trip delay:

RTO(t)=SRTT _(l)+4˜SVAR _(i),  (3)

[0014] where t represents the time at which the RTO is computed, and i=max: t_(i)≦t.

[0015] In real-time streaming applications, e.g., multimedia applications, NACK-based operation is preferred due to a lower overhead along the path from the receiver to the sender and potentially faster recovery of lost packets. However, the RTO estimator, as described in the preceding paragraphs, is typically suitable only for the ACK-based applications and is not applicable to NACK-based protocols by design. It produces an extended number of duplicate packets and causes unnecessary delays in the generation of the subsequent NACK requests in real-time streaming applications due to poor prediction of the next RTT value. In addition, NACK-based protocols do not have a common RTO estimation scheme that works well in heterogeneous Internet conditions. Despite these drawbacks, many NACK-based protocols are still utilizing the existing RTO estimating protocol, which is borrowed from TCP.

[0016] As described above, an RTO estimator is described by two parameters—the number of duplicate packets and the amount of unnecessary time out waiting. However, these two parameters cannot be minimized at the same time as they represent a basic trade-off of the estimator (i.e., decreasing one parameter will increase the other). Since TCP's RTO estimator proves to be inapplicable in NACK-based protocol, there is a need for such protocols to employ the class of optimal RTO estimators, which are described in this patent disclosure.

SUMMARY OF THE INVENTION

[0017] The present invention is directed to a method and system for estimating retransmission timeout (RTO) in a real-time streaming applications over the Internet between a server and a client.

[0018] The present invention provides a method of estimating retransmission timeout (RTO_(J)) used in a communication system to support multiple retransmission and the method includes the steps of: transmitting a plurality of data packets from a server to a client; transmitting a negative acknowledgment (NACK) packet for retransmission by the client if one of the data packets is missing; computing a round-trip delay (RTT_(l)) corresponding to a latency between sending the NACK packet to the server and receiving the corresponding retransmission of the missing packet from the server; calculating a plurality samples of delay (□_(j)) between the reception adjacent packets of the plurality of data packets by the client; determining a smoothed inter-packet delay variance (SVAR□_(j)) based on the calculated delay samples; and, computing the RTO_(j) based on the determined RTT_(i) and the determined smoothed inter-packet delay variance.

[0019] The present invention provides a system of managing transmission of a plurality of data packets over a communications link between a server system and a client system and includes: a means for receiving the data packets in the form of frame comprised of packets; a means for determining whether any frame packets were lost during transmission; a means for requesting that any lost frame packets be retransmitted; a means for determining a round-trip delay (RTT_(l)) corresponding to a latency between requesting retransmission of the lost frame to the server and receiving the corresponding retransmission of the lost frame from the server; a means for determining inter-burst packet delay variations; and, a means for determining a retransmission timeout (RTO_(j)) based on the determined RTT and the determined inter-burst delay variations.

[0020] These and other advantages will become apparent to those skilled in this art upon reading the following detailed description in conjunction with the accompanying drawings.

BRIEF DESCRIPTION OF THE DRAWINGS

[0021]FIG. 1(a) illustrates representative data flows in the TCP communication environment;

[0022]FIG. 1(b) illustrates representative data flows in the UDP communication environment;

[0023]FIG. 2 illustrates a block diagram of a system according to the present invention;

[0024]FIG. 3 illustrates the various layers that make up the Transmission Control Protocol/Internet Protocol (TCP/IP);

[0025]FIG. 4(a) illustrates the format of a user datagram protocol (UDP) packet at the server end in accordance with the present invention;

[0026]FIG. 4(b) illustrates the format of a user datagram protocol (UDP) packet at the client end in accordance with the present invention;

[0027]FIG. 5 illustrates a time chart depicting the jitter-based retransmission timeout (RTO) estimation according to the present invention; and,

[0028]FIG. 6 is a flow chart illustrating the operation of the retransmission timeout (RTO) estimator according to the present invention.

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT

[0029] In the following description, for purposes of explanation rather than limitation, specific details are set forth such as the particular architecture, interfaces, techniques, etc., in order to provide a thorough understanding of the present invention. However, it will be apparent to those skilled in the art that the present invention may be practiced in other embodiments which depart from these specific details. Moreover, for the purpose of clarity, detailed descriptions of well-known devices, circuits, and methods are omitted so as not to obscure the description of the present invention with unnecessary detail.

[0030] According to an embodiment of the present invention, a mechanism for controlling the retransmission of data packets in a digital communication environment is provided. Referring to FIG. 2, a system 10 which uses the invention comprises a first system 12, such as a server device, a second system 14, such as a client device, which is in communication with each other via access link of the network 16. Preferably, the inventive retransmission mechanism is placed at the client system. As shown in FIG. 2, the present invention can be practiced in a client-server environment, but the client-server environment is not essential.

[0031] In this invention, the server system 12 sends at least one source packet or sends packets in bursts to the client system 14 over the network. However, in the event that the source packet or burst packets from the server system 12 to the client system 14 is transmitted in error or lost, the client system 14 transmits a negative acknowledgment (NACK) packet to the server system 12 for retransmission. Then, the client system 14 establishes a limit on the timer period and retransmits the NACK packet to the server system 12 if the requested packet or burst packets are not received within a specified time period.

[0032] It should be noted that many real-time streaming servers are implemented to transmit their data in burst packets instead of sending one packet every specified period. This type of burst transmission typically reduces the overhead associated with frequent switching between processors. In addition, the bursty packet transmission is more adapted to handle varying packet sizes and allows more simultaneous streams per server. However, it is not required.

[0033] According to an embodiment of the present invention, packets that are received in error or lost are notified back to the server system 12 by the client system 14 via a NACK packet. Here, a user datagram protocol (UDP) is utilized. FIG. 3 depicts the various layers that make up the Transmission Control Protocol/Internet Protocol (TCP/IP) suite. Basically, TCP provides end-to-end transport services across multiple heterogeneous networks and the delivery of sequenced packets of information across the Internet. UDP is a connection-less transport protocol designed to operate using the service of IP and provides minimal error detection for streams of information. At the network level, IP provides a “datagram” delivery service.

[0034] The format of a UDP packet according to the present invention is shown in FIG. 4(a) and FIG. 4(b). Each packet in a real-time application carries a burst identifier, which allows the receiver to distinguish packets from different bursts. Referring to FIG. 1(b), a NACK packet is send to the server system if the source packet therefrom is lost along the transmission path. The loss of packets is detected by system 14 through gaps in sequence numbers. For each NACK-packet transmitted, the inventive protocol maintains a timer. If the timer expires, the NACK-packet is retransmitted. To avoid the confusion of which retransmission of the same packet actually returned to the client system, the header of each NACK packet contains an extra field specifying the retransmission sequence count in addition to the lost packet sequence number, as shown in FIG. 4(b). Thus, the client system can pair each retransmitted packet with the exact time when the corresponding NACK packet was sent out and properly measure the RTT.

[0035] As the source packets are being transmitted over a path with unpredictable delay, the present invention continuously adjusts the threshold at which the retransmit timer expires. That is, the transmission path changes during the lifetime of the connection, and the state of the routers (or switches) also changes as more or less traffic is being carried by the network. Accordingly, the present invention incorporates a new round-trip estimation mechanism that can be used to determine more accurate timing in retransmitting the NACK-packet. Unlike the prior art, estimate of the delay jitters between arriving packets is used in the present invention as the basis to set the retransmit timer threshold.

[0036] The following description is a detailed description of specific algorithms of a retransmission mechanism according to the present invention. In real time multimedia applications, the server system 12 typically sends packets in bursts for the duration of time, D_(b). Here, D_(b) is based on the streaming rate and the average packet size. Referring to FIG. 5, for each burst j, the last packet of the burst arrives to the client at time t_(j) ^(last), and the first packet of the burst arrived at time t_(j) ^(first). Thus, the inter-burst delay for burst j can be defined as below equation 4:

□_(j) =t _(j) ^(first) −t _(k) ^(last),  (4)

[0037] where burst k represents the last burst received before burst j (unless there is packet loss, k=j−1). For each burst j, using EWMA formulas similar to those in TCP, the smoothed inter-burst delay S□_(j) and smoothed inter-burst delay variance SVAR□_(j) are computed as defined in the following equations (5) and (6): $\begin{matrix} {{S\quad \bullet_{j}} = \left\{ \begin{matrix} {\bullet_{0},{j = 0}} \\ {{{\left( {1 - \bullet_{1}} \right)*S\quad \bullet_{j - 1}} + {\bullet_{1}*\bullet_{j}}},{j \geq 1},} \end{matrix} \right.} & (5) \\ {and} & \quad \\ {{{SVAR}\quad \bullet_{j}} = \left\{ \begin{matrix} {{\bullet_{0}/2},{j = 0}} \\ {{{\left( {1 - \bullet_{1}} \right)*{SVAR}\quad \bullet_{j - 1}} + {\bullet_{1}*{VAR}\quad \bullet_{j}}},{j \geq 1},} \end{matrix} \right.} & (6) \end{matrix}$

[0038] where □₁ and □₁ represent exponential weights and VAR□_(j) represents the absolute deviation of □_(j) from its smoothed version S□_(j−1). Here, S□_(j) is typically proportional to the burst duration D_(b), and thus it cannot be used the same way in real-time applications with a different burst duration. However, the smoothed variance SVAR□_(j) is fairly independent of the burst duration and reflects the variation in the amount of cross traffic in the router queues along the path from the server to the client.

[0039] With the transmission delay and its delay variation from equation (6), if T_(j) is the time when the client produced the j-th sample of the inter-burst delay □_(j) (ideally, T_(j) equals t_(j) ^(first)) and t_(i) is the time when the client computed the i-th RTT sample RTT_(l) (explained later), then the effective jitter-based RTO according to the present invention at time t is:

RTO _(J)(t)=n*RTT _(l) +m*SVAR□ _(j),  (7)

[0040] where i=max: t_(l)≦t and j=max: T_(j)≦t.

[0041] Furthermore, in the event that there is a longer delay between the measurements of the RTT, a slight modification to equation (7) can be provided to better approximate the RTO. This better estimator, called RTO_(JD), can be created by incorporating the duration between the time of the last RTT sample (i.e., t_(i)) and the time where the RTO is being estimated (i.e., t) into the RTO_(J) estimator:

RTO _(JD)(t)=(n+k(t−t _(l)))*RTT _(l) +m*SVAR□ _(J),  (8)

[0042] where i=max: t_(l)≦t, j=max: T_(j)≦t, and time units for t and t_(l) are seconds.

[0043] It should be noted that both jitter-based RTO estimators, as described in the preceding paragraphs, achieve optimality when □_(l)=0.5, □_(l)=0.25, k=0.5, and m=4.2792*n−2.6646. The remaining free parameter n can be used to vary the desired number of duplicate packets on a per-application basis: higher values of n correspond to fewer duplicate packets. The recommended values of n are between 1 and 4. It should be noted that frequent delay jitter samples prove to be very helpful in fine tuning NACK-based RTO estimation and can be used as a good predictor of the changes in the future RTTs.

[0044] It should be noted that the estimator of the present invention for determining the retransmission timeout (RTO) can be realized using a processor, microcomputer, an application-specific integrated circuit (ASIC), a programmable device, or any other device designed and operated to provide the functionality described herein. A flow chart of a key operation of the estimator is shown in FIG. 6, as hereinafter explained.

[0045] Referring to FIG. 6, each packet is plugged into an estimator algorithm that tracks two quantities: the round trip delay estimate (RTT) and the variance in inter-burst delay jitter (SVAR□). In step 600, each packet is received at the client system. If there were missing packets, a NACK packet for each packet is sent to the service system in step 610. In such a case, the transmission time of each NACK packet requesting a retransmission of packet (i), nack_(i), is recorded, then the timer to transmit the subsequent NACK packet is set in step 610. Meanwhile, if retransmission of the data packet is reliably completed from the server to the client system, the round trip delay (RTT) is computed in step 620.

[0046] According to the embodiment of the present invention, the receiver in a real-time session must periodically measure the round-trip delay. The client system obtains the RTT measurements by utilizing packet loss to measure the round-trip delay—each successfully recovered packet provided a sample of the RTT. That is, the RTT is the duration between sending a NACK and receiving the corresponding retransmission. Alternatively, the RTT is measured by the client by obtaining additional samples of the round-trip delay in cases when network packet loss was too low. To this end, the client periodically transmits simulated retransmission requests to the server if packet loss falls below a certain threshold. In response to these simulated NACKs, the server sends the needed packets to the client.

[0047] In step 630, it is determined whether the received packet belongs to the same burst as the previously received packet. If it is different, in step 640, the inter-burst delay is computed, as described in equation 4. The inter-burst delay is measured between the receipt of the first packet of the burst and the last packet of the previous burst at the client side. To distinguish between different bursts and utilize equation (4), the system records the parameters of the last received packet in step 650.

[0048] Next, the inter-burst delay samples are averaged into a smoothed inter-burst delay (S□) estimate, which is then used to control the retransmissions time-out parameter (RTO). Using step 660, for each burst, smoothed inter-burst delay and smoothed inter-burst delay variance are calculated in step 670 and 680, respectively. Step 670 is performed to update the smoothed inter-burst delay value, which is used for determining the variance in the subsequent calculation process. These steps are executed according to equations 5 and 6. Hence, as each new packets are added, the mean and variance change.

[0049] Finally, the retransmit timeout mechanism (RTO), which is a timeout to prompt retransmission of unrecovered data, is calculated in step 690. The latest RTT sample has the most relevance to the value of the future round-trip delay due to the large spacing between RTT samples in NACK-based applications. Upon expiration of the timer for packet (i), the client system 14 retransmits the NACK packet, nack_(l), and sets the timer for another RTO time unit for packet (i). The recommended values of n are between 0 and 4, and the value of m is set to: m=4.2792*n−2.6646.

[0050] In summary, the present invention provides a new RTO estimation mechanism, which achieves significant performance improvements (i.e., fewer duplicate packets and less unnecessary waiting time) over the existing RTO estimation algorithms when employed in NACK-based protocols. Having thus described a preferred embodiment for managing retransmission over a digital communications link, it should be apparent to those skilled in the art that certain advantages of the system have been achieved. The foregoing is to be constructed as only being an illustrative embodiment of this invention. Thus, persons skilled in the art can easily conceive of alternative arrangements providing a functionality similar to this embodiment without any deviation from the fundamental principles or the scope of this invention. 

What is claimed is:
 1. A method for estimating retransmission timeout (RTO_(J)) used in a communication system to support multiple retransmission of the same packet between a server and a client, the method comprising the steps of: (a) transmitting a plurality of data packets from said server to said client; (b) transmitting a negative acknowledgment (NACK) packet for retransmission by said client if one of said data packets is missing; (c) computing a round-trip delay (RTT_(l)) corresponding to a latency between sending said NACK packet to said server and receiving the corresponding retransmission of said missing packet from said server; (d) calculating a plurality samples of delay (□_(J)) between the reception adjacent packets of said plurality of data packets by said client; (e) determining a smoothed inter-packet delay variance (SVAR□_(J)) based on said calculated delay samples; and, (f) computing said RTO_(J) based on said determined RTT_(l) and said determined smoothed inter-packet delay variance.
 2. The method of claim 1, further comprising the step of controlling retransmission of said NACK based on said computed RTO_(j), said computed RTO_(J) being a delay between subsequent transmissions of said NACK packet from said client to said server.
 3. The method of claim 1, wherein said SVAR□_(j) is determined according to SVAR□ _(j)=(1−□_(l))*SVAR□ _(j−l)+□_(l) *D wherein □_(l) being set to 0.25 and D being the absolute difference of □_(J)−SVAR□_(j−l).
 4. The method of claim 1, wherein said RTO_(J) is determined according to RTO _(J) =n*RTT _(l) +m*SVAR□ _(j) wherein n being set between 0 and 4 and m being set to m=4.2792*n−2.6646.
 5. The method of claim 1, wherein the communication link between said server and said client comprises at least one of a wireless communications link, a wired communication link, and the combination of a wired communication link and a wireless communications link.
 6. A method for managing transmission of a plurality of data packets over a communications link between a server system and a client system; the method comprising the steps of: (a) transmitting a plurality of burst packets from said server to said client; (b) transmitting a negative acknowledgment (NACK) packet for retransmission by said client if one of said burst packets is lost, (c) determining a round-trip delay (RTT_(l)) corresponding to the actual time between the transmitting said NACK packet by said client and a determination by said client said lost burst packets was transmitted successfully; (d) calculating a plurality samples of inter-burst delay (□_(j)) between the reception of adjacent burst packets of said plurality of burst packets by said client; (e) determining a smoothed inter-burst delay variance (SVAR□_(J)) based on said calculated inter-burst delay samples; and, (f) computing said RTO_(J) based on said determined RTT_(l) and said determined smoothed inter-burst delay variance.
 7. The method of claim 6, further comprising the step of controlling multiple retransmission of said NACK based on said computed RTO_(j), said computed RTO_(j) being a delay between subsequent transmissions of said NACK packet from said client to said server.
 8. The method of claim 6, wherein said SVAR□_(j) is determined according to SVAR□ _(j)=(1−□_(l))*SVAR□ _(j−l)+□_(l) *D wherein □₁ being set to 0.25 and D being the absolute difference of □_(j)−SVAR□_(j−l).
 9. The method of claim 6, wherein said RTO_(J) is determined according to RTO _(J) =n*RTT _(l) +m*SVAR□ _(j) wherein n being set between 1 and 4 and m being set to m=4.2792*n−2.6646.
 10. The method of claim 6, wherein said communication link between said server and said client comprises at least one of a wireless communications link, a wired communication link, and the combination of a wired communication link and a wireless communications link.
 11. A system for estimating retransmission timeout (RTO) used in a communication system to support multiple retransmission of the same packet between a server system and a client system, comprising: means for controlling said multiple retransmissions of a data packet between said server system and said client system over said communication link based on an actual around-trip delay (RTT) and a smoothed inter-packet delay variance (SVAR□_(J)) associated with said client system, said RTT being a latency between sending a negative acknowledgment (NACK) packet to said server system responsive to a lost packet and receiving the corresponding retransmission of said lost packet from said server, said smoothed inter-packet delay variance (SVAR□_(J)) being variation of delays before and after each received packet or burst of packets, whereby the over-estimation and under-estimation of said RTO is relatively minimized.
 12. A system for managing transmission of a plurality of data packets over a communications link between a server system and a client system, comprising: means for receiving said data packets in the form of frame comprised of packets; means for determining whether any frame packets were lost during transmission; means for requesting that any lost frame packets be retransmitted; means for determining a round-trip delay (RTT_(l)) corresponding to a latency between requesting retransmission of said lost frame to said server and receiving the corresponding retransmission of said lost frame from said server; means for determining inter-burst packet delay variations; and, means for determining a retransmission timeout (RTO_(J)) based on said determined RTT and said determined inter-burst delay variations.
 13. The system of claim 12, wherein said means for determining said RTO_(j) further comprises a means for determining an inter-burst delay (□_(J)) between the reception of a first packet of said lost burst packets and a last packet of a prior burst packets; and, a means for determining a smoothed inter-burst delay variance (SVAR□_(j)),
 14. The system of claim 12, further comprising a means for controlling multiple retransmission of said NACK based on said computed RTO_(J), said computed RTO_(J) being a delay transmission of said NACK packet from said client to said server.
 15. The system of claim 12, wherein said SVAR□_(J) is determined according to SVAR□ _(j)=(1−□₁)*SVAR□ _(j−1)□₁ *D wherein □₁ being set to 0.25 and D being the absolute value of □_(J)−SVAR□_(J−1).
 16. The system of claim 12, wherein said RTO_(j) is determined according to RTO _(J) =n*RTT _(l) +m*SVAR□ _(J) wherein n being set between 1 and 4 and m being set to m=4.2792*n−2.6646. 